Assessing diagnostics for fault tolerant software

نویسنده

  • John Napier
چکیده

Reliability is of prime importance in computer-based safety critical systems where failure can lead to fatal consequences. Fault tolerant techniques in software have a vital role to play, because veri cation and validation techniques cannot guarantee that software is error free. Fault tolerance further improves the reliability of the system by ensuring it continues to operate safely when residual software errors are encountered. On line diagnosis is a critical aspect of software fault tolerance. At the present time however, there is a lack of any real guidance or understanding of how in practice to use online diagnostics e ectively and e ciently. For a particular program design it is di cult to reason about what an e ective diagnostic strategy should be, because our current level of understanding of software design errors is so poor. This thesis proposes that through a controlled process of experimentation, aimed at investigating the way software behaves in the presence of simulated faults, our understanding can be improved. In this thesis an empirical method is developed, and demonstrated, which aims to increase our understanding of the key factors in uencing the fault detection capabilities of on-line diagnostics. The experiments presented illustrate the potential of this approach and provide new insights of signi cance into the relevance of these factors. These experiments lay the foundations for a longer term progressive and controlled process of experimentation. Only by continuing the experimental process in this way will it be possible to move towards a better understanding of how to design e ective diagnostics for fault tolerant software.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Coverage-based testing strategies and reliability modeling for fault-tolerant software systems

Software permeates our modern society, and its complexity and criticality is ever increasing. Thus the capability to tolerate software faults, particularly for critical applications, is evident. While fault-tolerant software is seen as a necessity, it also remains as a controversial technique and there is a lack of conclusive assessment about its effectiveness. This thesis aims at providing a q...

متن کامل

Assessing the Reliability of Diverse Fault-Tolerant Systems

Design diversity between redundant channels is a way of improving the dependability of software-based systems, but it does not alleviate the difficulties of dependability assessment. Assuming failure independence between channels is unrealistic. Using statistical evidence from realistic testing, standard inference procedures can estimate system reliability, but they take no advantage of a syste...

متن کامل

A Distributed Fault Tolerant Architecture for Nuclear Reactor Control and Safety Functions

A new fault tolerant architecture that provides tolerance to a broad scope of hardware, software, and communications faults is being developed. This architecture relies on widely commercially available operating systems, local area networks, and software standards. Thus, development time is significantly shortened, and modularity allows for continuous and inexpensive system enhancement througho...

متن کامل

N - version design Versus one Good Version

Software Diversity as a way of achieving high reliability of software Software diversity has long been seen as way of achieving higher reliability of software than is attainable by a single software version subjected to a heroic testing. The known experiments with software diversity confirm that indeed fault-tolerant software employing diversity is "on average" more reliable than a single softw...

متن کامل

Implementing Software On-line Diagnostics in Safety Critical Systems

Current design and assessment methods cannot ensure that software is error free and consequently fault tolerant techniques play an important role in safety critical systems to ensure reliable software behaviour. Unfortunately general guidelines for the implementation of software diagnostics are lacking. The aim of this work is to build on the previous research in software error detection and th...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2001